Sentence encoding for Dialogue Act classification
نویسندگان
چکیده
Abstract In this study, we investigate the process of generating single-sentence representations for purpose Dialogue Act (DA) classification, including several aspects text pre-processing and input representation which are often overlooked or underreported within literature, example, number words to keep in vocabulary sequences. We assess each these with respect two DA-labelled corpora, using a range supervised models, represent those most frequently applied task. Additionally, compare context-free word embedding models that transfer learning via pre-trained language based on transformer architecture, such as Bidirectional Encoder Representations from Transformers (BERT) XLNET, have thus far not been widely explored DA classification Our findings indicate considerations do statistically significant effect accuracy. Notably, found viable sequence lengths, sizes, can be much smaller than is typically used experiments, yielding no improvements beyond certain thresholds. also show some cases contextual sentence generated by reliably outperform methods. Though BERT, its derivative improvement over approaches, previous work classification.
منابع مشابه
Unsupervised Approach for Dialogue Act Classification
This paper presents an unsupervised approach for dialogue act (DA) classification. We used a latent variable model to compress the dimensions of the feature vector. We introduced a paraphraser to reduce the variety of expressions and to solve the pragmatic problem for DA classification. The paraphraser seemed to work well on some DA classifications in the unsupervised approach. The results obta...
متن کاملCue-based dialogue act classification
Cue-Based Dialogue Act Classification Nick Webb Supervisors: Yorick Wilks & Mark Hepple In this thesis, we will address three research questions relating to the discovery and use of cue phrases for Dialogue Act classification. Cue phrases are single words, or combinations of words in phrases, that can serve as reliable indicators of some discourse function. In our case, we are looking to automa...
متن کاملActive Learning for Dialogue Act Classification
Active learning techniques were employed for classification of dialogue acts over two dialogue corpora, the English humanhuman Switchboard corpus and the Spanish human-machine Dihana corpus. It is shown clearly that active learning improves on a baseline obtained through a passive learning approach to tagging the same data sets. An error reduction of 7% was obtained on Switchboard, while a fact...
متن کاملLatent Semantic Analysis for Dialogue Act Classification
This paper presents our experiments in applying Latent Semantic Analysis (LSA) to dialogue act classification. We employ both LSA proper and LSA augmented in two ways. We report results on DIAG, our own corpus of tutoring dialogues, and on the CallHome Spanish corpus. Our work has the theoretical goal of assessing whether LSA, an approach based only on raw text, can be improved by using additio...
متن کاملDialogue act detection in error-prone spoken dialogue systems using partial sentence tree and latent dialogue act matrix
In a goal-oriented spoken dialogue system, the major aim of spoken language understanding is to detect the dialogue acts (DAs) embedded in a speaker’s utterance. However, errorprone speech recognition often degrades the performance of the SLU component. In this work, a DA detection approach using partial sentence trees (PSTs) and a latent dialogue act matrix (LDAM) is presented for spoken langu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Natural Language Engineering
سال: 2021
ISSN: ['1469-8110', '1351-3249']
DOI: https://doi.org/10.1017/s1351324921000310